SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

نویسندگان

چکیده

Arbitrary text appearance poses a great challenge in scene recognition tasks. Existing works mostly handle with the problem consideration of shape distortion, including perspective distortions, line curvature or other style variations. Rectification (i.e., spatial transformers) as preprocessing stage is one popular approach and extensively studied. However, chromatic difficulties complex scenes have not been paid much attention on. In this work, we introduce new learnable geometric-unrelated rectification, Structure-Preserving Inner Offset Network (SPIN), which allows color manipulation source data within network. This differentiable module can be inserted before any architecture to ease downstream tasks, giving neural networks ability actively transform input intensity rather than only rectification. It also serve complementary known transformations work both independent collaborative ways them. Extensive experiments show proposed transformation outperforms existing rectification has comparable performance among state-of-the-arts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition

Text recognition in natural scene is a challenging problem due to the many factors affecting text appearance. In this paper, we presents a method that directly transcribes scene text images to text without needing of sophisticated character segmentation. We leverage recent advances of deep neural networks to model the appearance of scene text images with temporal dynamics. Specifically, we inte...

متن کامل

Automatic Scene Text Recognition using a Convolutional Neural Network

This paper presents an automatic recognition method for color text characters extracted from scene images, which is robust to strong distortions, complex background, low resolution and non uniform lightning. Based on a specific architecture of convolutional neural networks, the proposed system automatically learns how to recognize characters without making any assumptions, without applying any ...

متن کامل

Accurate Scene Text Recognition Based on Recurrent Neural Network

Scene text recognition is a useful but very challenging task due to uncontrolled condition of text in natural scenes. This paper presents a novel approach to recognize text in scene images. In the proposed technique, a word image is first converted into a sequential column vectors based on Histogram of Oriented Gradient (HOG). The Recurrent Neural Network (RNN) is then adapted to classify the s...

متن کامل

Depth Structure Preserving Scene Image Generation

Key to automatically generate natural scene images is to properly arrange among various spatial elements, especially in the depth direction. To this end, we introduce a novel depth structure preserving scene image generation network (DSP-GAN), which favors a hierarchical and heterogeneous architecture, for the purpose of depth structure preserving scene generation. The main trunk of the propose...

متن کامل

Towards Text Recognition in Natural Scene Images

In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i4.16442